[`BLIP`] Fix daily CI failing test by younesbelkada · Pull Request #20877 · huggingface/transformers

younesbelkada · 2022-12-22T19:24:37Z

What does this PR do?

This PR fixes: https://github.com/huggingface/transformers/actions/runs/3754402958/jobs/6378634199

Why this fix is relevant?

The reference logits for this test were obtained under pytorch==1.13.1+cu116 and the daily CI uses pytorch==1.13.0+cu116. Setting the tolerance slightly higher (4e-2) fixes the test to make it cross-versions compatible.

cc @LysandreJik @sgugger @ydshieh

HuggingFaceDocBuilderDev · 2022-12-22T19:39:52Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

That is a very big tolerance. It would be better to identify the layer in the model causing this problem.

younesbelkada · 2022-12-26T10:04:34Z

Hmm at the beginning I thought that the Softmax was causing the issue, leading to large round errors but the test pass locally with torch+cu116==1.13.0 but does not pass on the docker image that uses the same version. Will investigate more!

ydshieh · 2023-01-02T10:12:34Z

tests/models/blip/test_modeling_blip.py


-        self.assertTrue(torch.allclose(torch.nn.Softmax()(out_itm[0].cpu()), expected_scores, atol=1e-3, rtol=1e-3))
-        self.assertTrue(torch.allclose(out[0].cpu(), torch.Tensor([[0.5053]]), atol=1e-3, rtol=1e-3))
+        self.assertTrue(torch.allclose(out_itm[0][0][0].cpu(), expected_scores))


It would be great if we can figure out why the previous test logic failed between the environment.
Let me know if I could help here, @younesbelkada :-)

ydshieh · 2023-01-02T16:40:09Z

On GCP (my own/ CI runners), all torch versions give

(torch 1.13.x)

[[0.97982633 0.02017363]]
[[0.50528485]]

or (torch 1.12.1)

[[0.97982633 0.02017365]]
[[0.5052849]]

so

[[0.9798, 0.0202]]
[[0.5053]]

will work. Not sure why you got larger differ though, but it is likely an env issue.

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

- add model.eval - fix tolerance for GPU devices

younesbelkada · 2023-01-04T19:47:06Z

Thanks so much @ydshieh 💯 , the tests seem to pass now on the CI docker image with your suggested values!
Seems that something was wrong with my env indeed

ydshieh

Nice 💯 and thank you!

fix tolerance

ccdd72a

sgugger reviewed Dec 23, 2022

View reviewed changes

younesbelkada added 2 commits December 26, 2022 08:50

fix test

9e16295

fix nit

70c96ec

ydshieh reviewed Jan 2, 2023

View reviewed changes

younesbelkada and others added 5 commits January 4, 2023 19:23

add correct values

d945b12

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

fix few things:

a919503

- add model.eval - fix tolerance for GPU devices

fix logits values

185040b

debug: remove model.eval

1f5bea2

cleanup

b7cd172

younesbelkada requested a review from ydshieh January 4, 2023 19:47

woops

67d5fda

younesbelkada requested a review from sgugger January 5, 2023 08:15

ydshieh approved these changes Jan 5, 2023

View reviewed changes

sgugger approved these changes Jan 5, 2023

View reviewed changes

younesbelkada merged commit bf82c9b into huggingface:main Jan 5, 2023

silverriver pushed a commit to silverriver/transformers that referenced this pull request Jan 6, 2023

[BLIP] Fix daily CI failing test (huggingface#20877)

d33e86e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`BLIP`] Fix daily CI failing test#20877

[`BLIP`] Fix daily CI failing test#20877
younesbelkada merged 9 commits intohuggingface:mainfrom
younesbelkada:blip-fix-tolerance

younesbelkada commented Dec 22, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Dec 22, 2022 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

younesbelkada commented Dec 26, 2022 •

edited

Loading

Uh oh!

ydshieh Jan 2, 2023

Uh oh!

ydshieh commented Jan 2, 2023 •

edited

Loading

Uh oh!

younesbelkada commented Jan 4, 2023 •

edited

Loading

Uh oh!

ydshieh left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

younesbelkada commented Dec 22, 2022

What does this PR do?

Why this fix is relevant?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

younesbelkada commented Dec 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh Jan 2, 2023

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Jan 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada commented Jan 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Dec 22, 2022 •

edited

Loading

younesbelkada commented Dec 26, 2022 •

edited

Loading

ydshieh commented Jan 2, 2023 •

edited

Loading

younesbelkada commented Jan 4, 2023 •

edited

Loading